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ABSTRACT 

The paper descibes a compact and highly efficient 
disk file management system responsible for the management and 
allocation of space on moving head disK drives in a medium-scale 
time-sharing system. The disk file management system is a major 
component of the Exp-^rimental Time-Snaring System (ETSS) developed 
the Learning Research and Development Center. ETSS has been 
successfully operating for nearly two years and is a multilanguage 
general-purpose time-sharing system based on Digital Equipment 
Coporation's PDP-15. (Author) 
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Abstract 



This paper describes a compact and highly efficient disk file man- 
agement system responsible for the management and allocation of space 
on moving head disk drives in a medium-scale time-sharing system. 
The disk file management system is a major component of the Experi- 
mental Time-Sharing System (ETSS) developed at the Learning Research 
and Development Center. ETSS has been successfully operating for 
nearly two years and is a multi-language, gene ral - purpose time - sharing 
system based on a DEC PDP-15, 
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Learning Research and Development Center 
University of Pittsburgh 

Introduction 

Most commercially available small machine time -sharing systems 
offer rudimentary or special-purpose disk file management software. 
Seme systems require, the programmer to be aware of the underlying 
disk structure and o^ten place the responsibility for record packing 
within allocated disk space upon the program. More commonly, the 
file management software is special-purpose and is tailored to meet 
the requirements of a single language such as BASIC or a special appli- 
cation such as hospital record keeping. The file management system 
described in this paper is a demonstration that a powerful, general- 
purpose disk file management system offering large system features 
and capabilities is possible on a smaller computer with only United 
memory. 

Called DFM for Directory File Management, the system is the 
successor to two earlier disk file management systems, each based on 
inverted list principles and designed for fixed-head disk environments. 
The current DFM employs list structures and is designed for moving 
head devices although fixed head units can be supported as well. DFM 
in its present form has been in use for ten months and is a major com- 
ponent of the Experimental Time -Sharing System (ETSS) developed at 
the University of Pittsburgh, ETSS has been operational for nearly two 



years and is a multi -language , general-purpose time-sharing system 
based on a Digital Equipment Corporation PDP-15. 

DFM maintains a complete and self-contained directory structure 
on each disk unit so that packs may be mounted and dismounted at will ., 
independent of all others. Packs are named providing the programmer 
with the option of specifying a pack by name or simply permitting the 
file operation to default to the one or more resident system packs. Con- 
sistent with the device independent features of ETSS, DFM allows users 
to read, write, and manipulate data stored in named datasets with no 
regard for the physical properties of the device. Each dataset may con- 
tain any number of variable length records separated by any number of 
file marks. Datasets may be temporary, or they may be "cataloged" 
with protection keys in the user's private directory, in the system 
"library" available to all or in the directory of another user if permitted. 
Features are provided t, enable multiple users to simultaneously access 
and update common datasets without the risk of interference. 

Desc ription 

Device Characteristics . In the current implementation, DFM 
operates on IBM 2 3 14-equivalent disk drives using 2316 or equivalent 
disk pacKS. The hardware established minimum unit of transfer is a 
256 18-bit word sector. There are 200 sectors per cylinder and 200 
cylinders per pack for a total of 40,000 sectors or 10. 24 million words. 
This translates to approximately 25 million characters of storage on 
a single pack. The average rotational delay of the unit is 12. 5 milli- 
seconds, and head seek timet, range from 20-80 milliseconds with an 
average of 50 millit^econds. 



Although DFM was designed for 2314-equivalent units, the specific 
device details are important only on the device handler level. A variety 
of disks can be supported including fixed head as well as moving head 
units. 

Space Allocation and Deallocation , It is clear from the device 
performance statistics that head movement is a major component of 
total transfer time and that excessive or unnecessary head movement 
will sharply degrade overall device performance. This is particularly 
true for the space allocation and deallocation processes. As a result, 
a primary design objective was to minimize the numbei of head move- 
ments required to locate available space and to return space that is no 
longer needed. In addition, the allocation algorithm was designed to 
allocate space in such a way as to minimize head movement during sub- 
sequent file processes. 

The minimum unit of allocation and deallocation is an implemen- 
tation-defined block which, in the current release of ETSS, is two disk 
sectors or 512 18-bit words. Two different types of bit nnaps are used 
to monitor the allocated/unallocated status of the 20,000 blocks on each 
pack. A "Cylinder Bit Map** is maintained on each cylinder which reflects 
the allocated/unallocated status of eacn block on that cylinder. As there 
are 100 blocks on a cylinder, the bit map requires 100 bits or only six 
18-bit words. When a new pack is mounted, each of these cylinder bit 
maps is scanned and a memory resident '*Bit Alap of Bit Maps" is con- 
structed for the pack. Each bit in this memory resident bit map reflects 
the status of a cylinder bit map and indicates whether or not there is 
unallocated space on the cylinder. Only 12 18-bit words are required 
for this bit map. 
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When a disk block is required, an attempt is made to allocate 
space from the cylinder on which the disk heads are currently positioned. 
If this is not possible and space must be found r another cylinder, the 
memory resident bit map of bit maps is examined to avoid unnecessary 
accesses of cylinder bit maps in which there is no unallocated space. 
The lowest numbered cylinder with unallocated space is selected, the 
heads are moved to that cylinder and the cylinder bit map is read into 
memory. The two levels of bit maps ensure that no more than one head 
movement is ever required to allocate space. The cylinder selection 
strategy eliminates the scattering of allocated blocks and reduces head 
movements during subsequent file processes. 

Space deallocation is a background process and is permitted to 
occur only when no other disk file processes are active.. When a block 
is to be deallocated, the appropriate cylinder bit map is read into 
memory, and the bit corresponding to the block is reset. The bit map 
of bit maps is updated also if the cylinder was fully allocated prior to 
the deallocation. 

Primary Data Structure . The primary and sole data structure on 
disk is a noncircular bidirectional list of blocks with each block con- 
taining "records" linked in a circular, bidirectional list within the block. 
Shown in Figure 1, this data structure is used in both directories and in 
datasets and is uniform and consistent on all levels. Asa result, DFM 
is relatively small in size because only a few, very compact subroutines 
are required to create and manage this single data structure. 

The first two words of each block in a list serve as pointers to the 
preceding and following blocks. Bidirectionality enables block lists to 
be scanned in either direction and permits validity checks to be per- 
formed on all links during normal file operations. To eliminate the 
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requirement that the backward link of the first block in a list be updated 
each time a new block is added to the end of the list, block lists are 
noncircular. A null value is used to indicate that a block is either the 
first or the last block in a list. 

Records within a block are linked in a circular, bidirectional list 
with each record preceded by a pair of list pointers. Circularity is 
possible since the entire block is read into- memory and can be examined 
and updated at one time. Any unused space within a block appears as 
the last record in the list. 

Directory Structure , The DFM directory structure consists of 

1 

three interlinked levels and is depicted in Figure 2. For each pack, 

I 

there is a single "Master Directory" of user ID's and other user specific 
information, "User pirectories" of dataset names and other dataset 
specific information and "Datasets" containing "records" written to the 
disk by user programs. The master directory consists of one or more 
linked blocks that contain the identification numbers of all users who 
have user directories on the pack, other information about those users 
and pointers to each user directory. The ID of a user is added auto- 
nnatically to this master directory when the user catalogs a dataset on 
the pack. This ID entry is removed if all the datasets in the associated 
user directory are deleted. 

A user directory is created when a new user catalogs a dataset 
on the pack and is automatically destroyed when all the datasets within 
the user directory have been deleted. As with the master directory, a 
user directory will expand and contract in size as datasets are cataloged 
or deleted. An entry in a user directory consists of a dataset narnc, 
information about the dataset and a pointer to the first block of the data- 
set. This entry is removed when the dataset is deleted by the user. 



A dataset consists of a series of vari^ible length records stored 
within blocks. A Ithougii a dataset m^y Vontain as many records as can 
be stored on a disk pack, no record can be longer than a single block 
less the link words. The manner in which records are stored within 
blocks is transparent to the usey who merely reads and writes records 
completely unaware of the underlying block structure of the pack, 

* f 

Ope ration 

File Processing . When a program indicates that it wishes to 
access a cataloged dataset, the master directory of the specified pack 
is searched for the appropriate ID. If it is foivhd, the pointer to the 
associated user directory is retrieved, and the user directory is 
searched for the dataset name. If the name appears in the directory, 
an entry is created and is added to a memory resident list called the 
"Active List. " Each entry in the active list contains the name of a 
dataset currently being accessed, information about the dataset and the 
user accessing it, and a set of values identifying the current position 
of the user within the dataset. These include the block numbers of the 
prior, current and next blocks in the dataset and the position of the cur- 
rent record within the current block. These values are updated during 
normal file processing as the user moves within the dataset through 
file manipulation requests or by reading and writing records. 

The memory resident active lists enables file requests to be 
satisfied quickly without time consuming directory or dataset searches, 
A read request, for example, requires only a single transfer. The 
memory resident entry uniquely identifies the disk block where the next 
record resides and the position of the record within the block. A write 
request usually requires only two transfers, the first to read in the cur- 
rent block and the second to rewrite it to disk with the newly added recor 
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Less frequently, when an additional block must be allocated, a write 
request will require from throe to live transfers depending upon the_ 
cylinder bit map and disk block already in memory. 

Dataset Deletion , When a data set is to be deleted, the user 
directory fentry for the dataset is removed first. If no names remain 
in the user directory, the complete user directory is deleted as well 
as the associated ID in the master directory. The list of blocks con- 
taining the dataset is then linked to the end of the disk resident deleto 
list in preparation for .he background deallocation process. From the 
user's point of view, the dataset is effectively dele'ed at this time, even 
though actual deallocation has not yet begun. During intervals in which 
no other requests are pending, successive blocks are delinked from the 
delete list, and the corresponding bits in the cylinder bit mapj are reset. 

Directory Integrity F'ollowing a Crash . The directory structure 
is relatively impervious to the effects of disl^ failures or system crashes. 
All directory creation and modification operations are sequenced so 
that an unexpected interruption will not result in directory inconsisten- 
cies or doubly allocated space. For example, when a new block must 
be allocated for a dataset, the new block is allocated and is written to 
disk with the appropriate backward link before the forward link of the 
prior block is updated. Were this sequence interrupted, the worst out- 
qome would be allocated space that is outside the directory structure 
and is unaccounted for. The directory structure remains intact. 

Although the directory structure could be disrupted by a hardware 
or power failure which interrupts an on-going disk transfer, operating 
experience with the current as well as prior versions of DFM has 
demonstrated that a well-designed disk structure can be made sub- 
istanlially crash-proof if directory creation and update operations are 
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properly sequenced. In nearly two years of daily operation, no dataset 
or directory has ever been disrupted or lost on ETSS, 

Conclusion 

DFM is an operating demonstration that a powerful, general- 
purpose disk file management system 13 feasible on a smaller computer 
with limited memory. In ten months of extensive use, the file manage- 
ment system has met ox exceeded all major performance objectives 
and is currently managing over 7000 on-line datasets on the resident 
system packs. The primary data structure of lists of blocks and lists 
of records \yithin blocks have proved to be sound and particularly well- 
suited for small -machine operating systems. 
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